Observability: Metrics or Logs, Which is Truly Enough?
Find the balance between metrics and logs on your system observability journey. In which situations is each more effective? I analyze with my experience.
12 posts found.
Find the balance between metrics and logs on your system observability journey. In which situations is each more effective? I analyze with my experience.
What RED metrics are, when they are needed, and whether they are always comprehensive...
Determine which system monitoring method, agent-based or agentless, is right for you in 3 simple steps. A practical guide based on my experience.
Mustafa Erbay shares his experiences on the importance, usage, and practical tips for metric and trace data to deeply understand system issues…
Correctly setting log levels in our systems requires striking a critical balance between detailed monitoring and reducing unnecessary noise. This…
What should be considered when defining a log level strategy in production environments? Which log level should be used when? I'll explain with my experiences.
Effective management of log levels is critical for system health and troubleshooting processes. In this article, we explore the necessity of the debug level.
How often should you patch kernel CVEs while meeting your SLA commitments? I took a deep dive into the costs and risks involved.
Ensuring data integrity in AI-powered content pipelines is critical. I'll share practical approaches, from ingestion to output, for issues I've encountered in.
I explain step-by-step how to write robust health checks (HEALTHCHECK) for situations where Docker containers appear 'up' but the application isn't actually.
An old internal load balancer fails unexpectedly — and shapes the technical and career-defining test it puts an engineer through.
A detailed look at the 'zombie process' problem in production environments and how to analyze and resolve this hidden form of resource waste.